On reducing sampling variance in covariate shift using control variates

نویسندگان

  • Wouter M. Kouw
  • Marco Loog
چکیده

Covariate shift classification problems can in principle be tackled by importanceweighting training samples. However, the sampling variance of the risk estimator is often scaled up dramatically by the weights. This means that during cross-validation when the importance-weighted risk is repeatedly evaluated suboptimal hyperparameter estimates are produced. We study the sampling variances of the importance-weighted versus the oracle estimator as a function of the relative scale of the training data. We show that introducing a control variate can reduce the variance of the importance-weighted risk estimator, which leads to superior regularization parameter estimates when the training data is much smaller in scale than the test data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reducing the Variance of Likelihood Ratio Greeks in Monte Carlo REDUCING THE VARIANCE OF LIKELIHOOD RATIO GREEKS IN MONTE CARLO

We investigate the use of Antithetic Variables, Control Variates and Importance Sampling to reduce the statistical errors of option sensitivities calculated with the Likelihood Ratio Method in Monte Carlo. We show how Antithetic Variables solve the well-known problem of the divergence of the variance of Delta for short maturities and small volatilities. With numerical examples within a Gaussian...

متن کامل

The Efficiency of Variance Reduction in Manufacturing and Service Systems: The Comparison of the Control Variates and Stratified Sampling

There has been a great interest in the use of variance reduction techniques VRTs in simulation output analysis for the purpose of improving accuracy when the performance measurements of complex production and service systems are estimated. Therefore, a simulation output analysis to improve the accuracy and reliability of the output is required. The performance measurements are required to have ...

متن کامل

Safe and Eeective Importance Sampling

We present two improvements on the technique of importance sampling. First we show that importance sampling from a mixture of densities, using those densities as control variates, results in a useful upper bound on the asymptotic variance. That bound is a small multiple of the asymptotic variance of importance sampling from the best single component density. This allows one to beneet from the g...

متن کامل

Generating Antithetic Random Variates in Simulation of a Replacement Process by Rejection Method

When the times between renewals in a renewal process are not exponentially distributed, simulation can become a viable method of analysis. The renewal function is estimated through simulation for a renewal process simulation for a renewal process with gamma distributed renewal times and the shape parameter a > 1. Gamma random deviates will be generated by means of the so called Acceptance Rejec...

متن کامل

A Multilevel Approach to Control Variates

Control variates are a popular technique for reducing the variance of Monte Carlo estimates. Recent literature has enlarged the set of potentially useful control variates. Still, finding an control variate that efficiently reduces estimation error can be a challenging task for which the theoretical literature provides little guidance. In this note we show by theory and example how to construct ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1710.06514  شماره 

صفحات  -

تاریخ انتشار 2017